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ABSTRACT 

We present non-parametric radial mass profiles for ten QSO strong lensing 
galaxies. Five of the galaxies have profiles close to p(r) oc r -2 , while the rest 
are closer to r _1 , consistent with an NFW profile. The former are all relatively 
isolated early- types and dominated by their stellar light. The latter — though the 
modeling code did not know this — are either in clusters, or have very high mass- 
to-light, suggesting dark- matter dominant lenses (one is a actually pair of merging 
galaxies). The same models give Hq 1 = 15.2l 2 'yGyr (Hq = 64^gkms _1 Mpc -1 ), 
consistent with a previous determination. When tested on simulated lenses taken 
from a cosmological hydrodynamical simulation, our modeling pipeline recovers 
both H Q and p(r) within estimated uncertainties. 

Our result is contrary to some recent claims that lensing time delays imply 
either a low Hq or galaxy profiles much steeper than r~ 2 . We diagnose these 
claims as resulting from an invalid modeling approximation: that small deviations 
from a power- law profile have a small effect on lensing time-delays. In fact, as we 
show using using both perturbation theory and numerical computation from a 
galaxy-formation simulation, a first-order perturbation of an isothermal lens can 
produce a zeroth-order change in the time delays. 

Subject headings: gravitational lensing, galaxies: halos 



Introduction 



In our current understanding of str ucture formation, ga laxies form from the dissipation 
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with a is numerically found to be in the range a ~ [1,1.5], b ut other parameterizations 
are also possible (e.g., Merritt et al. 2006 ; Graham et al. 2006al lbl). Over the visible region 



of galaxies (r a), the dark matter distribution tends towards a single power law with: 
p r~ a . At much larger radii (r ^> a), the distribution tends towards p ~ r~ 3 . 

While the distribution of dark matter is now relatively well understood, less clear is 
the effect of the gaseous component on the final total mass distribution of galaxies, sinc e 
simulations involving gas remain a significant technical challenge (see e.g., iMayerl 12004 ) . 
However, it is likely to be more complex than a simple sum of the gaseous, stellar and dark 
matter components, since the gas collapse will cause a contraction of the underlying dark 
matter distribution, increasing the ce ntral concentration of dark matter, an d possibly even 
making the halo more spherical (e.g., lYounglll980l ; iKazantzidis et al.ll2004l ). Despite these 



worries, the final resulting mass distribution is simpler to predict if the stars and gas form the 
dominant dynamical component of the galaxy - as is the case in the centre of massive spiral 
galaxies and ellipticals. There we ma y expect a n isothermal di stribution (p oc r~ 2 ), either 
as a result of equilibrium gas physics (jShulll99ll ). or relaxation (ILynden-Belll 1 1 96 71 ) . Indeed, 
dynamical measurements of massive galaxies suggest that an isothermal d i stribution provides 



an excellent fit over a wide range of radii (see e.g., Ivan der Marellll99ll ; iKronawitter et al. 



2000l ; iHumphrey et al.ll2006l ). At larger radii, however, we expect a transition from p ~ r -2 to 
p ~ r _1 as the dark matter halo becomes more and more dynamically dominant. Eventually, 
at very large radii (r ^> a) we expect a second transition to p ~ r -3 . 

In this paper, we test the above theoretical predictions by using strong lensing to de- 
termine the non-parametric density profile of ten QSO lensing galaxies, for the first time. 
Strong lensing measurements, which are limited in radius to the outermost observed image, 
probe radii r < a. Thus, we hope to probe the transition from p ~ r~ 2 to p ~ r _1 as we move 
from galaxies dynamically dominated by their stars, to galaxies dynamically dominated by 
their dark matter. We do not measure far enough out (r > a) to test the prediction of an 
eventual transition to p ~ r~ 3 . 



2. The method 



2.1. Searching lens models 



A given set of lensing observables, even with zero noise, is consistent with a variety 
of lensing mass distributions^ This is the well-known problem of lensing degeneracies. If 



^ee the Appendix for brief derivation of this and some other relevant results from lensing theory. 
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multiple source redshifts are available, as is the case in rich lensing cl usters, the effect o f 
degeneracies is minimal, enabling robust mapping of the mass profiles (jSaha et al.ll2006d ). 
For galaxy lenses, however, multiple source redshifts are very unlikely, and as a result de- 
generacies present a serious difficulty. The most important degeneracy couples time delays 
and the steepness of the mass profile: steeper mass profiles have higher values of (Hq At) for 
the same image positions and magnification ratios. 

Given the steepness degeneracy, one might hope that if At and H are both known, 
the steepness is constrained. But even that is no t guaranteed, because time delays are 
also i nfluenced by, for examp le, twisting ellipticity (IBernstein &: Fischer! Il999l ; IZhao fc Qin 
20031 ; ISaha &: WiUiamsll2006l ) implying further degeneracies. So it would appear that lensing 
observables constrain only some dreadful function of H , steepness, and shape. 

However, the situation is not so hopeless. The data may indeed allow a wide range of 
models, but by searching through the allowed 'model-space' and adding some prior informa- 
tion, one can make probabilistic inferences. To do so, one needs (1) a prior on model-space, 
and (2) an algorithm for sampling allowed models. 

The technique of pixelated lens models is one possible strategy for (1) and (2). The 
lens model is constructed as a superposition of mass pixels. It turns out that the data 
can be encoded as linear equations on the mass pixels - - though there are many more 
equations than there are pixels. Conservative but reasonable priors can be encoded as linear 
inequalities on the mass pixels. Specifically, in this work we require the projected density 
to (i) be non-negative, (ii) be inversion symmetric, meaning E(— Z ) = unless the 

galaxy is a known irregular, (iii) be centrally concentrated, with the local density gradient 
pointing at most 45° away from the center, (iv) have no pixel more than twice the sum of 
its neighbors, the central pixel excepted, and (v) have circularly averaged profile nowhere 
shallower than i? _7min - we will return to this constraint, and our cho ice of in a moment . 



The explicit equations a nd inequalities are given in early p apers (Saha fc Williams 



1997 



Abdelsalam et al.lll998aB ) for weak as well as strong lensing. I Williams fc Sahal (120001 ) then 
introduced the idea of uniformly sampling the model-space that satisfies all the equations 
and inequalities, resulting in a model-ensemble from which estimates and uncertainties can 
be derived. 



Saha & Williams ( 
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2006a); 



Still later improvements improve the statistical sampling in the ensemble. Consequently, the 
ensembles of 100 models which we use in this paper are actually better sampled that the 
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ensembles of 200 in Paper I because they eliminate clusters of strongly correlated models^. 
Several different tests have been done: 



1. recov ering Ho from mock data derived from simple parametrized lenses (IWilliams fc Saha 



2000); 



2. tests for biases in the model-sampling procedure (ISaha et al.ll2006bl ) 



3. comparing the inferred distributions of the dimensionless time delay <p [see Equa- 
tion |A8] for observed versus simulated lenses (Paper I). 



The technique does well in all of these tests, notwithstanding the seeming crudeness of th e 
prior. It also correctly predicts the morphology of Einstein rings (ISaha fc Williams! l200ll ). 
Later in this Section we show a further test, for simultaneous recovery of both Hq and the 
radial mass profile. 



Related ideas appear in other work: iKeeton fc Winnl (120031) use d an e nse mble of para- 
metric models to interpret a rare quintuple lens, whereas iBradac et al.l (120051 ) and lDiego et al. 
(120051 ) used free-form lenses but not model-ensembles. 



As in Paper I, we do not use flux ratio measurements. Tensor magnifications from 
fluxes and VLBI constrain time delays bet ween nearby images, . but r emarkably, they have 
no discernible effect on longer time delays (jRaychaudhury et al.ll2003l ). The physical reason 
is not hard to appreciate: magnification is essentially the second derivative of the arrival 
time surface (see Appendix I). Time delays between widely separated images tend to wash 
out the sort of local variations of density that cause differences in magnifications. The use of 



tensor magnifications to resolve substructure with pixelated models is studied in ISaha et al. 
(l2007r i. 



2.2. The ten-lens model-ensemble 

Paper I presented Ho derived from ten time-delay lenses: the quads J0911+055, B1608+656, 
and B1115+080, and the doubles B0957+561, B1104-181, B1520+530, B2149-274, B1600+434, 
J0951+263, and B0218+357. The results derived from an ensemble of 200 composite mod- 
els. Each composite model consisted of pixelated mass maps of all ten lenses, with a shared 
Hq. The value of Hq varied across the ensemble (as shown in the histogram in Figure 1 of 



2 These various developments will be described in detail in Coles (in preparation). See 
http://www.qgd.uzh.ch/programs/pixelens/ for the program itself. 
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Paper I), leading to an estimate with uncertainties. But within each composite model Hq 
was the same, thus coupling the data from different lenses. 

The coupling of different time-delay lenses through the shared Hubble parameter is 
key to constraining both the lenses and Hq in spite of degeneracies. Suppose we take from 
the ensemble a ten- lens model with a common Hq. Then we make all the lenses steeper, 
while increasing Hq, as permitted by the steepness degeneracy. We cannot do so indefinitely, 
because the steepness transformation (see Equation IA1 II) will eventually create a negative 
density in one of the lenses. Once we reach this point, we cannot increase Hq any more, 
nor can we make any of the lenses any steeper. At the other end, the prior assumes, 
conservatively, that the lenses are no shallower than E(-R) ~ R~ lmia . This means that if one 
lens is at i? _7min , H cannot get any lower and none of the lenses can get any shallower, 
even if they are nowhere near R~ 7min . The full picture is more complicated because of shape 
degeneracies and the details of the prior, but the coupling of different lenses through the 
shared Hubble parameter does indeed allow both H and lens profiles to be simultaneously 
constrained. 

In previous papers on time delay lensing galaxies (like Paper I), we used 7 min = 0.5. 
Here, where we have a heterogeneous sample of galaxies, and where we would like to probe 
shallower density distributions, it is not clear that this is the right prior to use. Therefore 
we experimented with changing this prior in the range 7 m i n = [0,0.5]. We start with the 
same prior as in Paper I: 7 min = 0.5. Then, we re-run the analysis for the full ensemble, 
but for any galaxy which had a density profile at or near 7 min = 0.5, we reduce the prior to 
7min = 0. This allows shallower galaxies to be as shallow as they like, without systematically 
biasing the sample space available to the steeper ones. Because of the weaker prior we use 
in this paper, we include a new determination of H in Figure [3j It is somewhat lower, but 
consistent within uncertainties, with the results from Paper I. 



Starting with the model ensemble, as detailed above, we then circularly average each 
projected-mass map to obtain a X(-R) and deproject by numerically solving the usual Abel 
integral equation: 



To evaluate the numerical derivative and then the integral, we linearly interpolate S(-R) up 
to the R of the outermost image and assume £ oc R~ 2 outside. Since we have an ensemble 
of models, we automatically derive uncertainties on p{r) as well. The technique is the same 



2.3. Deprojection 
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as in ISaha et al.l (J2006cJ), and the result is not sensitive to the assumed outer slope, or the 



R where that outer slope begins, provided the latter is not beyond the image region. 

The above procedure assumes spherical symmetry and a possible concern is that this may 
introduce significant systematic error. To test for this, we projected and then deprojected 
(using the above method) a triaxial halo taken from a cosmological iV-body simulation. We 
recovered the spherically averaged density distribution perfectly, within the errors. This 
demonstrates that our assumption of spherical symmetry in the deprojection algorithm is 
equivalent to spherically averaging a (mildly) triaxial halo — a practise which is already 
common in the simulation community. It is important to emphasise that we do not assume 
spherical symmetry in our mass model at any stage until this final deprojection. 



2.4. Test of the method 



We have tested the simultaneous recovery of p(r) and H using lenses derived from 
a galaxy-formation simulation. A test against a range of simulated galaxies, mimicking 
observed surveys in detail, is desirable. Unfortunately, galaxy simulations at the required 
resolution and including both dark-matter and gas dynamics are not yet numerous. So we 
simply generate several lenses out of one simulated galaxy. 



The simulated galaxy is taken from a hydrodynamical cosmological simulation by lMaccio et al 



(120061 ) . The galaxy is an El or E2 triaxial ellipti c al do minated by stars in the inner region, 
but with overall ~ 80% dark m atter iMaccio et al.l (120061 ) . By ray-tracing through this galaxy, 
as described in (jMacciol 120051 ) . we generated five mock lenses: three doubles and two quads. 
The image positions and time delays were fed into PixeLens, which then generated a model 
ensemble consisting of mass maps and inferred ifo-values. We experimented with changing 
the minimum steepness constraint: i? _7min in the prior (see section |27TI for a full description 
of our prior); the results were insensitive to changes in the range: 7 min = [0,0.5]. In the 
following tests, we show results for 7 min = - the weaker prior. 

Figure [1] shows the distribution of Hq 1 from the model ensemble in this test. We see 
that the correct value is well within the 68%-confidence region. 

Figure [2J left and middle panels, show the 2D projected mass map of the simulated 
galaxy and the recovered distribution for one of the quads, respectively. Recall that all five 
mock lenses had the same mass map, but our modeling codes did not know that; the results 
from the other four galaxies were nearly identical to those presented here, and we omit these 
for brevity. 
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Hubble constant (local units) 
100 50 




Hubble time (Gyr) 

Fig. 1. — The simulated Hq 1 as recovered from the model ensemble. The unbinned values 
give 13. Gyr. The correct value was 14 Gyr. 



Notice how well the projected mass map is recovered. We obtain the correct elongation 
along the ^-direction, while the contours of equal mass density are well-matched within the 
errors. Beyond the images (where we have no data) the mass distribution drops sharply and 
is not to be trusted; we do not use any data beyond the outer-most image in our deprojection. 

It is worth commenting a little on this plot. The mass map shown is the average of the 
full model ensemble; it represents the expectation value of the mass in each pixel. This does 
not mean, however, that it is not a genuine representation of the projected mass distribution. 
In this specific test here, PixeLens accurately recovers the input morphology and mass, within 
estimated uncertainties. In other similar ex amples in the lite rature, PixeLens has resolved 



substructure in lensing galaxies and clusters (jSaha et al.ll20071 ). and in one example, resolved 
an interacting pair of galaxies (see Paper I). 

The deprojection of the recovered mass map is shown in the right panel of Figure [2J 
The simulated galaxy density profile (dashed line) is well recovered within the errors (grey 
band). Over-plotted is a power-law fit to the recovered density distributi on, obtained usin g 



a standard Levenberg-Marquardt least-squares fitting technique (see e.g. iPress et al.lll992l ). 
Note that, although this deprojection assumes spherical symmetry, our mass model up until 
this point did not (see also section 12.31) . 
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Fig. 2. — Recovered density distribution of the simulated galaxy, derived from the PixeLens 
mass map of one of the quads. Left, projected mass map for the simulated galaxy; the 
units for the contours are in units of the critical density. Middle: recovered projected mass 
map from PixeLens (see text for details). The solid circles show the distribution of (fake) 
images. Right deprojected radial density distribution. The shaded gray region is the 68% 
confidence region. The oblique solid line is the best-fit power-law r~ a . The dashed curve 
is the actual spherically-averaged p(r). The plot is truncated at the projected radius of the 
outermost lens image. The vertical dot-dashed line to the left marks the resolution limit of 
the simulation — the apparent core to the left of this line is a result of the force softening and 
is not physical. 



3. Profiles of observed time-delay lenses 

Having tested the pipeline of recovering both Hq and p(r) on mock lenses generated 
from a recent galaxy-formation simulation, we now derive radial profiles for real lenses. 



The first part has already been done in ISaha et al.l (j2006al ) (Paper I), where a model 



ensemble for ten time-delay lenses was generated, to estimate Hq. However, in this work we 
use a weaker prior and so it is worth plotting our determination of Hq again. This is shown 
in Figure [3j the results are consistent with Paper I, and we recover Hq 1 = 15.2i^ 7 Gyr 
(H = 64.3^g km s _1 Mpc" 1 ). Next, we derive p(r), with uncertainties, from the model 
ensemble, as explained above. Finally we fit a power law r~ a to the non-parametric p(r), as 
in section [27^1 above. Such a simple least-squares power law fit does not do justice to the full 
ensemble mass distribution which our non-parametric method derives for each galaxy. We 
use it just as a convenient way of representing our results on a single plot (Figure [6]), and as 
a simple way to compare our results with theoretical expectations. Our detailed results are 
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Fig. 3. 

(#0 = 



H 1 as recovered from the ten lens ensemble. The unbinned values give 15.2^' 7 Gyr 
Mpcr 1 ). 



64.313 km s" 1 



shown in Figures H] and [5] 

Figure [6] then summarizes the log density slopes, a, for all ten lenses. Five are close 
to r -2 or slightly steeper, and five are shallower, clustering around ~ r _1 . Formal errors 
on a are shown by the error bars. These probably somewhat underestimate the actual 
uncertainties. 

The five systems close to r~ 2 are all relatively isolated galaxies (though nearby galaxies 
contributing some external shear are often present). Fr om estimates of the stellar mass 



through population synthesis models (IFerreras et al. 



20051 ) , all five have both stars and dark 



matter contributing to lensing. In other words, these are all galaxies where we would expect 
an isothermal mass distribution. 

The five shallower lenses are different. In B0957+561 and J091 1+055 the main lensing 
galaxy is in a cluster, while B0218+357 and B1104+180 have little starlight, suggesting a 
dark-matter dominated lens; in all of these cases we would expect a shallower profile than 
r~ 2 . Finally, in B1608+656 the lens consists of two interacting galaxies; here the density 
peak of the host galaxy will be averaged with the lower density material from the infalling 
system, hence here again we would expect a shallower profile than r~ 2 . 

It is worth adding an extra comment for B0218+357, which appears distinct from the 
other lenses. Our five steep lenses have the outer-most image at a projected radius of 
< lOkpc. By contrast, all of the shallow lenses have the outer-most image at > lOkpc, with 
the exception of B0218+357. B0218+357 has its outermost image at a projected radius of 
little over ~ 2kpc, while its profile is of intermediate steepness, with a ~ 1.5. It may be 
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Fig. 4. — Inferred radial profiles for the lenses. The left column shows the projected density 
E(i2). The middle column shows the enclosed projected mass M(R). The right column shows 
the deprojected density profiles p(r) along with power-law fits r~ a . The plots are truncated 
at the innermost and outermost image in each system. The gray bands are 68% uncertainties 
derived from the pixelated model ensembles and are probably a good representation of the 
uncertainties. The quoted uncertainties on a are formal la errors. These galaxies are all 
close to p(r) cx r~ 2 . Continued in Figure [51 
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Galaxy name 



Fig. 6. — The mean steepness, a (where p oc r a ), for the ten lensing galaxies. Formal errors 
are marked. 



that these peculiarities are the result of a poor determination of the optical centre for this 
lens, due its small size on the sky (?). Such errors are not currently taken into account in 
our models. We hope to investigate this lens in more detail in future work. 

We conclude that while the inferred steepness has a large uncertainty, the identification 
of nearly r~ 2 versus shallower profiles is a strong result. 



4. Comparison with previous work 



Some recent work has concluded that if measured time delays are to be consistent with 
Hq 1 ~ 14Gyr (Ho — 70kms _1 Mpc™ 1 ) t hen the galaxy mass profiles i nvolved must be sig- 



nificantly steeper than r~ 2 . In particular, iKochanek fc Schechterl (120041 ) find that isothermal 
lenses give H^ 1 = 20 ± 1 Gyr (H = 48 ± 3 km s" 1 Mpc -1 ) from the measured time delays. 
Dobke &: King! (120061 ) report that Ho = 71 kms -1 Mpc -1 is consistent with an r~ 2 profile 
within 3<x. However, when they separate the lens sample into doubles a nd quads, the 3a 



uncert ainties do not overlap, so it is not clear how to interpret their result. iKoopmans et al. 
( 120061 ) attempt to break the steepness degeneracy using a single velocity dispersion for each 
galaxy. They conclude that all of their sample (time delays are not available for those lenses) 
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Fig. 7. — Distribution of reduced time delays, S [see equation I A9j . for the simulated galaxy 
lens (solid histogram). To produce this plot we traced rays from a uniform distribution of 
sources in the source-plane and calculated the distribution of arrival times for their images 
in the image plane. Over-plotted are the similar results for an isothermal lens (solid vertical 
line) and a first-order perturbed isothermal lens (Equation [8], with e = 0.156^; dotted line). 
The latter two are calculated analytically in section [5j Since an isothermal lens has 5 = 1, 
this plot measures the deviation of time delays from isothermal. 



is consistent within observational errors with isother mal profiles. However, w hen the same 
method is applied to the time-delay lens B1115+ 080 (ITreu fc Koopmansll2002r) a profile sig- 
nificantly steeper than isothermal is inferred. See iDobke et al.l (120071 ) for a recent discussion 
of the difficulties and proposed solutions. 



Somewhat in contrast is the work of iFerreras et al.l (120051 ). who compared stellar-mass 
profiles derived from population synthesis models with total-mass profiles reconstructed using 
PixeLens for 18 early-type lensing galaxies, of which six had time-delay measurements (all 
also used in the present work). Although that paper does not discuss the steepness of the 
profiles as such, it finds no inconsistencies when the measured time-delays were imposed as 
constraints along with Hq 1 = 14 Gyr. Plausible dark-matter fractions are inferred for all the 
lenses, and the tilt of the fundamental plane is reproduced. 



The above studies all use different (though sometimes overlapping) samples of lenses. 
This is not a problem for pixelated models, since the method explicitly allows for heteroge- 
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neous samples. However, in studies that assume a simple form for lensing galaxies, sample 
selection becomes an issue and may be partly responsible for differences between results 
for H and density profiles presented in the literature so far. However, as we shall now 
demonstrate, the details of the mass model have a far more important effect. 

Figure [7J shows the distribution of reduced time delays H [see Equation IA9] for the sim- 
ulated lens with a uniform distribution of sources (solid histogram). Although the simulated 
lens is well- fit by an isothermal density distribution (Figure [2D, its distribution of reduced 
time delays is not. 



5. Time delays in a perturbed isothermal lens 

The unexpectedly broad histogram in Figure [7] is readily understood from a perturbative 
calculation, as we now show. 

Consider an isothermal lens with Einstein radius 6 E , with a source at (p, 0). As is well 
known, the arrival time at sky position (x, y) is 

t(x, y) = \{x- v? + \y 2 - W^ + z/ 2 - (3) 

Equating the gradient of r to zero gives the image positions (provided \p\ < 6 E ) 

x+=p + 9 E , x_=p- 9 E , y± = 0. (4) 

Here (x + , 0) is a minimum, (x_, 0) is a saddle point, and there is no third image because the 
potential is singular. For p = the two images merge into an Einstein ring. 

Plugging the image positions back into Equation we obtain the time delay 

T_-r + = 2p9 E . (5) 

Note that r + < r_. It is easily verified that the reduced time delay ( 1A9I) is unity. 

Now imagine that we add a perturbing potential 5$ to Equation ([3]). As is usual in 
perturbation theory, we Taylor expand about the unperturbed solution. 

5$± = e ± x + e'±y + 0(e 2 ) (6) 

where e± and e± are Taylor-expansion coefficients around the two images (x±,0). Equating 
the gradient of the perturbed arrival time r — 5$ to zero, we have to first order: 

x ± =p±6 E + e ±J y± = e' ± (l±6 E /p). (7) 
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We now write e for the mean of e± and neglect their difference. The numerator in the reduced 
time delay flA9[) remains 2p8-g, to leading order, whereas the denominator is now 2(p + e) #e- 
Hence we have 

V 

Now comes the key point. Since p can be small, e/p need not be small. Hence small 
perturbations of an isothermal lens can produce zeroth-order changes in the reduced time 
delay. 

In Figure [7] we have over-plotted the distribution of 5 for e = 0.15 (dotted line). For 
this computation, we had yfp uniformly distributed, corresponding to a uniform distribution 
of sources in a disk. There is a gap in the perturbed distribution around 5 = 1, which 
arises because \p\ < 6e- Otherwise, however, our simple first-order perturbative calculation 
qualitatively reproduces the distribution for the simulated galaxy, including the asymmetry. 

The sensitivi ty of the time de lays to perturbations of the lens has been noted in the 



literature before. IWitt et al.l (120001 ) studied perturbations of the isothermal lens, and found 



that a shear perturbation could produce a large change in the time delays. Similarly, lOguri 



( 120061 ) considered more general perturbations to the isothermal sphere and found that per- 
turbations to the isothermal potential can give large changes in the time delays - indeed 
our result is implicit in their Figure 1. However, these works did not stress the importance 
of co mparing perturbations to the time delays against perturbations to the image positions. 



Only iBlandford fc Narayanl (119861 ) appear to have commented directly on this: "It has been 
our experience that fairly small changes in the lensing potential may introduce much larger 
difference in the delays than in the image properties. " 



6. Conclusions 

Our main result on density slopes (shown in Figures H] and and summarized in Figure 
E]) is that five of the galaxies studied are close to p oc r~ 2 , whereas the other five are 
shallower and clustered around p cx r _1 . The former are all relatively isolated galaxies where 
stars and dark matter probably both contributing significantly to lensing. The shallow 
systems are different: B0957+561 and J091 1+055 lie in clusters; in B1104-181 the lens 
images are far out in the halo suggesting that this lens is probably dark-matter dominated; 
B02 18+357 has little starlight, suggesting also a dark-matter dominated lens, though the 
centroid is difficult to determine for this system; and finally B1608+656 is a pair of interacting 
galaxies. Thus the latter five are just those systems which we might have guessed would be 
shallower. Our technique is tested on a mock survey of five lenses generated from a galaxy- 
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formation simulation, from which H$ and the mass profile are both recovered within the 
claimed uncertainties (Figures [I] and [2]). 

A significant finding is that, when calculating time delays: 

a nearly-isothermal mass distribution 9^ a perfect isothermal lens. (9) 

We demonstrate this through a simple perturbation calculation showing how a small devi- 
ation from an isothermal potential can be amplified in the time delays. We also verify this 
effect in a simulated galaxy, where time delays from 0.5 to 2 (and, in rare cases, an arbitrarily 
large number) times the value expected for a isothermal lens can occur (Fig. [7]), depending 
on the source position. 



more recent modeling work (IBernstein &: Fischer 



Although fleetingly anti cipated by Blandf ord fc Naravanl (119861) and s uggested by some 



1999 



Zhao fc Qinll2003l : ISaha fc Williams 



20061 ). the assertion ([9]) is contrary to the majority of lens models so far. Such models assume 
that lensing time delays are completely determined by the image positions and radial power- 
law index. Once we drop that invalid approximation, the apparent contradictions between 
time delays and mass profiles much discussed in several recent papers (see Section @| are 
resolved. 
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A. Lensing theory: scales and degeneracies 

This paper makes use of a number of rather specialized results from lensing theory. 
These are well-known known to experts, but since the original derivations are spread over 
diverse papers and a long time span, we review them here. 



A.l. The arrival-time surface 

We start with the expression for the change in travel time for a virtual photon originally 
from a source at sky position S but deflected at the lens so that the observer sees it coming 
from sky-position 9{. 

ct(e i; e s ) = (1 + ^^t^Pi - °s) 2 - (1 + I no') in \e t - e'\ d 2 e'. (ai) 
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This is just Equation (2.6) from iBlandford fc Narayarj (119861 ) with all the dimensions put 
back in; as in that paper, £ is the sky-projected density, z L is the lens redshift, d Qh is the 
angular-diameter distance from observer to lens, and so on. The first term on the right is 
the geometrical path difference between a deflected and undeflected photon trajectory, and 
the last term is the general-relativistic delay. The above equation assumes that the lens is 
a) infinite simally thin (compared to d OL and d LS ), and b) lies in a representative patch of the 



Universe (IBlandford fc Narayarj Il986l ). Both of these are conservative assumptions. 



The light-travel time (1A1I) can be expressed in a useful dimensionless form by some 
substitutions. First, we have the scaled time variable 



T 



[1 + Z L 



-I -1 



ct. 



(A2) 



Each of d OL ,d os ,d LS is (c/H ) times a redshift- and cosmology- dependent factor of order 
unity. In other words 

t ~ H H (A3) 

or the time delay in units of the Hubble time. Next, we introduce the dimensionless distance 
factor 

d LS 

C LS — 7 



(A4) 



Then we consider 



(A5) 



which can be interpreted as £ in units of the critical density for sources at infinity. (In many 
papers, the c LS factor is absorbed inside the definition of k; that refers k to a particular z s .) 
Finally, we write an operator that solves Poisson's equation in two dimensions 



v- 2 /^^ J f{e')\n%-o'\d 2 e'. 



(A6) 



With all these substitutions we obtain 



r(0 u 9 S ) = \{0 X - 6 s f - 2c ls V- 2 k(^) 



(A7) 



For given S , the abstract surface t(0i) is called the arrival-time surface. It is abstract 
because real photons do not arrive from all T — they arrive only from 1 that make t(0 1 ) 
extremal (Fermat's principle). In other words, images appear where the arrival-time surface 
has a maximum, minimum, or saddle-point. 
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A.2. Dimensionless time delays 

Equation flA7j) is completely dimensionless if we measure angles in radians, which em- 
phasizes that a lens model k(&i) on its own is dimensionless. Scales in lensing enter thro ugh 
equations flA2j) . flA4[) and flA5j) . which are all model- independent. (See lNityanandalll990l . for 
an interesting discussion of this point.) 

The last statement may seem paradoxical. If lens models are dimensionless, how can a 
lens model help infer Hq? The resolution is that lens models relate two time scales — Hq 1 
and observable time delays — even though they have no time scales of their own. Because of 
this, we can speak of a dimensionless time delay even for an observed lens with a measured 
time delay, and this is very useful for comparing a heterogeneous set of lens es. Two way s of 
turning a measured delay At into a dimensionless number are suggested in ISahal (120041 ) . 



One possibility is to compute 



At 



(A* 



where where 81,82 are the lens-centric distances (in radians) of the first and last images to 
arrive, At is the observed time delay between them, and D is the factor (1 + z L )(d OL d os /d LS ) 
from equation (IA2I) — which recall is oc Hq . The factor ^(^1+^2)) with angles in radians, is 
approximately (exactly for an isothermal lens) the fraction of the sky covered by the Einstein 
ring. It turns out that ip lies in the range ^ [0, 2] for doubles and ~ [2,8] for quads. This 
property was exploited by lSaha et all (j2006al ) to compare observed and A^-body time delays. 



Another possibility is the reduced time delay 



At 



(A9) 



which has a w i der ra nge than (p but has the attractive property that for an isothermal lens 



Oguril (120061 ) uses 5 as a basis to compare the distributions of observed and model 



time delays; we also use this quantity in section [5] 



A. 3. Steepness and other degeneracies 



Let us rewrite the arrival time slightly as 

r = 2V- 2 (l-c hS K)-0 1 -0 s 



(AlO) 
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where we have used the fact that in two dimensions V 2 # 2 
since it has no optical effect. Now consider the rescaling 



4, and also discarded a S term 



t' = At, (1-cwO = A(l-£wc), 9' S = X0 S 



(AH) 



applied to Equation flAlOj) . where A is constant in the region of images. There is no effect 
on the image positions or relative magnifications — basically we are redefining the contour 
spacing in a contour map of the arrival-time surface (such as Blandford & Narayan's Figure 2) 
without altering the figure. However, time delays change because r is rescaled, while the 
total magnification changes because S is rescaled. Meanwhile, rescaling (1 — c ls k) means 
that the mass profile gets steeper (if A > 1) or shallower (if A < 1). 

Equation (lAllj) is t he wel l- know n steepness degeneracy§| The simple geometric deriva- 
tion given here i s from ISahal (120001 ). but it was first derived by a different method by 
Falco et all Jl985h . 



The steepnes s degeneracy is broken if there is a range of z s (lAbdelsalam et al.lll998bl ; 
Saha et al.ll2006d ). That is because a range of z s gives a range of c LS , and Equation ( lAllj) can 
only be applied if c LS is constant. Another way to break the degeneracy is to have number 
counts of weakly lensed objects (IDye et all 120021 ) because then the total magnification is 
constrained and S cannot be freely rescaled. However for galaxy lenses, neither of these 
routes is in practice available. Multiple sources at the same z s do not help here, since the 
rescaling (lAlip can be applied to multi ple sources, so even an Einstein ring does not break 
the degeneracy (ISaha fc Williamsl 120011 ). 



That the steepness degeneracy is a serious problem for galaxy lenses is now widely 
appreciated, and researchers agree that steeper mass profiles lead to larger values of (Ho At). 
But as our Figure [7] shows, significant degeneracies can persist even at fixed slope. These 
degeneracies appear to consist of rescalings of the type flAlip but with varying A. The details 
remain to be investigated. 
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